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8 Abstract 

9 A new empirical algorithm is proposed to estimate surface chlorophyll-a concentrations (Chi) in 

10 the global ocean for Chi < 0.25 mg m" 3 (~ 77% of the global ocean area). The algorithm is based 

11 on a color index (Cl), defined as the difference between remote sensing reflectance (R IS , sr' 1 ) in 

12 the green and a reference formed linearly between R rs in the blue and red. For low Chi waters, in 

13 situ data showed a tighter (and therefore better) relationship between Cl and Chi than between 

14 traditional band-ratios and Chi, which was further validated using global data collected 

15 concurrently by ship-borne and SeaWiFS satellite instruments. Model simulations showed that 

16 for low Chi waters, compared with the band-ratio algorithm, the Cl-based algorithm (CIA) was 

17 more tolerant to changes in chlorophyll-specific backscattering coefficient, and performed 

18 similarly for different relative contributions of non-phytoplankton absorption. Simulations using 

19 existing atmospheric correction approaches further demonstrated that the CIA was much less 

20 sensitive than band-ratio algorithms to various errors induced by instrument noise and imperfect 

21 atmospheric correction (including sun glint and whitecap corrections). Image and time-series 

22 analyses of SeaWiFS and MODIS/Aqua data also showed improved performance in terms of 

23 reduced image noise, more coherent spatial and temporal patterns, and consistency between the 

24 two sensors. The reduction in noise and other errors is particularly useful to improve the 

25 detection of various ocean features such as eddies. Preliminary tests over MERIS and CZCS data 

26 indicate that the new approach should be generally applicable to all existing and future ocean 

27 color instruments. 
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32 1. Introduction 

33 Over the past half century, algorithms to invert ocean color (i.e., spectral radiance or reflectance 

34 of the surface ocean) to phytoplankton chlorophyll-a concentrations (Chi in mg m' 3 ) have 

35 evolved from simple empirical regressions [Gordon and Morel, 1983] to semi-analytical 

36 inversions based on radiative transfer theory [Sathyendranath et al., 1989; Carder et al., 1999; 

37 Maritorena et al., 2002; others] . While each of these has its own advantages and disadvantages 

38 (and thus, applicability range), an algorithm based on a spectral ratio of remote-sensing 

39 reflectance ( R IS , sr" 1 ) has historically been used as the default algorithm formulation to produce 

40 global chlorophyll-a products from measurements made by satellite instruments. These include 

41 the Coastal Zone Color Scanner (CZCS, 1978-1986), the Sea-viewing Wide Field-of-view 

42 Sensor (SeaWiFS, 1997-2010) and the Moderate Resolution Imaging Spectroradiometer 

43 (MODIS, 1999 - present for Terra satellite and 2002 - present for Aqua satellite). The current 

44 default Chi algorithm for SeaWiFS and MODIS is based on the OCx form of O’Reilly et al. 

45 (2000), with coefficients derived using in situ data from the NASA bio-Optical Marine 

46 Algorithm Dataset (NOMAD) version 2 

47 [http://oceancolor.gsfc.nasa.gov/REPROCESSING/R2009/ocv6/] . The default SeaWiFS 

48 algorithm is referred to as OC4 in this paper. Correspondingly, many large-scale studies of ocean 

49 carbon cycles and biogeochemistry that utilized satellite ocean color data, from regional-, basin-, 

50 to global-scale, have used the OC4 data products [e.g., Gregg et al., 2005; Behrenfeld et al., 2006; 

5 1 Yoder and Kennedy, 2006; Polovina et al., 2008] , leading to documented changes in Chloc 4 and 

52 primary productivity at various spatial/temporal scales and connections to climate variability. 

53 An early review on the history of the band-ratio empirical algorithms as well as their advantages 

54 and disadvantages was provided in Gordon and Morel [1983] , and recently re-visited by Dierssen 

55 [2010] . Briefly, the most recent OC4v6 algorithm evolved from its predecessors in the 1970s and 

56 1980s [Clarke et al., 1970; Arvesen et al., 1973; Hovis and Leung, 1977, Clark et al., 1980; 

57 Gordon and Clark, 1980; Morel, 1980] , when the radiance ratio of blue and green wavelengths 

58 was recognized to correlate well with surface Chi. The underlying assumption is that the relative 

59 changes between the blue and green bands are primarily driven by changes in phytoplankton and 

60 their direct degradation products (i.e., the traditional Case-I scenario, see Morel and Prieur, 

61 1977), and the latter can therefore be inferred from the former. Indeed, despite the various 

62 studies showing the algorithm artifacts in non-Case-I (i.e., Case-II) waters [e.g., Dierssen et al.. 
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63 2002; Hu et al., 2003; Odriozola et al., 2007; others] , global validation efforts of the SeaWiFS 

64 Chloc 4 data products proved that for most open ocean waters, the algorithm performed well, with 

65 RMS differences from ship-based Chi (after logarithmic transformation) of 0.2 - 0.3 without 

66 significant bias [Gregg and Casey, 2004; McClain et al., 2004; McClain, 2009] . 

67 Agreement/disagreement varied among different ocean basins because the same regression 

68 coefficients, determined from the global dataset optimization, were applied universally [Gregg 

69 and Casey, 2004] . To address these regional differences, various band combinations and 

70 regression coefficients were developed for different water types [e.g., Kahru and Mitchell, 1999; 

71 McKee et al., 2007; Mitchell and Kahru, 2009], with similar band-ratio forms. 

72 All previous global- scale studies used spatially and temporally composited data (e.g., monthly 

73 composites at reduced resolution) to reduce data volume and fill in data gaps due to cloud cover 

74 and other measurement/algorithm artifacts. Chi data product errors at original spatial and 

75 temporal resolutions are smoothed and smeared in these higher- level data products, thus 

76 complicating the propagation of errors to trend/variability analyses at global or regional scales. 

77 These errors are particularly evident at low concentrations (Chi < 0.1 mg m 3 ). Fig. la shows a 

78 typical example of SeaWiFS Global Area Coverage (GAC) Level-2 Chi data product for the 

79 Sargasso Sea, an oligotrophic ocean gyre in the North Atlantic. Due to a variety of reasons (see 

80 details below), the image shows patchiness, speckle noise (pixelization), and is not spatially 

81 coherent. Note that all non-zero Chi values in this image are regarded as acceptable-quality and 

82 used in composing the higher-level (i.e., lower spatial and temporal resolution) products, because 

83 all “low-quality” data, as defined by the various quality flags, are already discarded. The image 

84 was selected rather arbitrarily for demonstration purpose, and similar problems could be 

85 visualized in almost every Level-2 GAC image. Clearly, these issues need to be addressed in 

86 order to understand how they may propagate to higher-level products to affect the large-scale 

87 trend/variability analyses. 

88 Recently, to derive spatially coherent and temporally consistent ocean color patterns from 

89 satellite images contaminated by severe sun glint, a new color index (Cl) was developed for 

90 satellite ocean color observations ([Hu, 2011]. Instead of using a blue-green band-ratio as the 

91 independent variable, the Cl is calculated as the difference between the green-band reflectance 

92 and a reference formed linearly by the blue and red bands. This is similar to the design of the 

93 MODIS fluorescence line height (FLH, Letelier and Abott, 1996) and MERIS maximal 
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94 chlorophyll index (MCI, Gower et al., 2005), except that the bands are shifted to blue-green-red. 

95 Hu [2009] used a similar form to detect and quantify the reflectance peak in the MODIS 859-nm 

96 band, and proved that the floating algae index (FAI), derived using the 645-859-1240 band 

97 combination, was much less sensitive to variable observing conditions (aerosols, sun glint, thin 

98 clouds, solar/viewing geometry) than band-ratio algorithms. The MODIS Cl appears to be 

99 relatively insensitive to residual errors due to imperfect empirical glint correction, and in glint- 

100 free areas it is also well correlated with MODIS band-ratio Chi [Hu, 201 1], suggesting that a new 

101 Chi algorithm might be developed to remove residual atmosphere-correction related errors and 

102 image noise. 

103 Inspired by these recent works, a new empirical algorithm to retrieve Chi using the Cl as the 

104 independent variable is developed and validated in this paper. Using data collected primarily by 

105 SeaWiFS but also by MODIS/Aqua and other satellite instruments, we evaluate the performance 

106 of such a band-difference algorithm (i.e., the Cl algorithm or CIA) in comparison with the OC4 

107 band-ratio algorithm. We demonstrate and argue that because the Cl is much more tolerant than 

108 the band ratio to various perturbations in sensor hardware and data processing (e.g., instrument 

109 noise, residual errors in atmospheric correction, whitecap and sun glint corrections, stray light 

110 contamination), and also more tolerant to perturbations of Chl-independent particle 

111 backscattering from the water column, the CIA appears superior to band-ratio algorithms in 

112 deriving a more consistent and accurate Chi climate data record for most oligotrophic oceans. 

113 The paper is arranged as follows. The principles to “measure” Chi from space, although found in 

114 the refereed literature, are briefly introduced for the reader’s convenience. The in situ and 

115 satellite data used to develop and validate the new algorithm are then described. Following that, 

116 the new Chi algorithm (CIA) is described and validated for SeaWiFS. Its sensitivity to errors and 

117 perturbations, in comparison with the OC4 algorithm, is analyzed in detail, and further 

118 demonstrated using satellite measurements. Sample time-series at several arbitrarily selected 

119 oligotrohpic ocean sites as well as from global-scale data are used to evaluate the performance of 

120 the new algorithm. Finally, we discuss the new algorithm’s applicability to other satellite 

121 instruments such as MODIS, MERIS, and CZCS, and discuss its potential to improve data 

122 quality, time-series and cross-sensor consistency, and to improve image quality in feature 

123 detection. 

124 
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2. Principles to “measure” Chi from space 

A multi-band ocean-color satellite instrument measures the top-of-atmosphere (TOA) radiance or 
reflectance in several spectral bands coving the visible to the near-infrared domain. On SeaWiFS, 
the spectral bands are centered at X=41 2, 443, 490, 510, 555, 670, 765, and 865 nm, respectively. 
After radiometric calibration (including in-orbit vicarious calibration, Franz et al., 2007) the 
calibrated at-sensor reflectance (pA)) is used to derive the at-sea remote sensing reflectance (R rs ) 
[Gordon, 1997], With some simplifications, this can be expressed as: 

pA) = pA) + p«A) + A)pv*A)+7A)piA) + nt(k)t 0 (k)RM, (1) 

where p, is that due to Rayleigh scattering, p ar is that due to aerosol scattering and aerosol- 
Rayleigh interactions, p wc is the whitecap reflectance, p g is the sun glint reflectance, T and t are 
the direct and diffuse transmittance from the target (pixel of the imagery) to the sensor (satellite), 
and to is the diffuse transmittance from the sun to the target. 

Deriving R,- S (~k) from pA) is through a sophisticated atmospheric correction, which uses lookup 
tables for aerosol and molecular properties [Gordon and Wang, 1994a&b, Ahmad et al., 2010, 
Bailey et al., 2010] after removing contributions from whitecaps [Frouin et al. 1996] and sun 
glint [Wang and Bailey, 2001]. The retrieved R n Ck) is then used as the input to an established 
bio-optical inversion model to derive Chi. For the OC4 algorithm, Chi is derived as [O’Reilly et 
al., 2000]: 

ChloC4 =10 a ° + a1 *^ + a2 *X*X + a3*x*X*X + a4*x*X*X*X 

X = log 10 (R) and R = max(R„(443, 490, 510))/R„(555), (2) 

where aO - a4 are the empirical regression coefficients, for which the current values (version 6) 
are 0.3272, -2.9940, 2.7218, -1.2259, -0.5683, respectively. 

The algorithm details and their performance at global and regional scales can be found in the 
published literature as well as in online documents 

(http : //oceancolor . gsfc .nasa .go v/REPROCES S IN G/R2009/ oc v6/) . 

3. Data sources used in this study 

In situ data were obtained from the NASA SeaBASS archive (SeaWiFS Bio-optical Archive and 
Storage System), which is a database of measurements collected by many research groups in 
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154 order to develop and validate satellite ocean color algorithms. The NOMAD dataset, described 

155 by Werdell and Bailey [2005] , is a subset of SeaBASS specifically compiled for bio-optical 

156 algorithm development, as it contains coincident measurements of Chi, R rs (k), and other data 

157 collected simultaneously in the global oceans. 

158 Like the current OC4 algorithm, the dataset used to develop the CIA was taken from NOMAD 

159 version 2, covering a period of 1991-2007 and containing 4459 data records. Similar to Morel et 

160 al. [2007a] , the NOMAD data used in the present study for algorithm development are those with 

161 Chi determined via HPLC, since Chi determined from traditional fluorometric methods often 

162 suffer from contaminations by chlorophyll-6 and chlorophyll-c, as demonstrated from data 

163 collected in the southern ocean [Marrari et al., 2006; Dierssen, 2010] . Further, we applied the 

164 following criteria to select data for the oligotrophic oceans: R rs (k ) >0.0 sr' , Chi >0.0 mg m' , 

165 bottom depth > 30.0 m, and latitude between 60°N and 60°W. A total of 136 data records were 

166 obtained. 

167 To evaluate the algorithm performance when applied to satellite data, in situ data were also 

168 obtained from the SeaBASS archive through online query. The following criteria were used to 

169 search for the in situ - satellite matching pairs: bottom depth > 30 m; solar zenith angle < 70°; 

170 satellite zenith angle < 56°, time difference between satellite and in situ measurements < 3 hours; 

171 satellite Chi variance (standard deviation divided by mean) from the 3x3 pixels centered at the in 

172 situ stations < 15%; difference between modeled and measured surface irradiance < 100%; wind 

173 speed < 35 m s' 1 . For SeaWiFS, a total of 1424 matching pairs were obtained for 1998-2010. 

174 The online query also resulted in the satellite Level-2 computer filenames corresponding to the 

175 matching pairs. These Level-2 data products were derived by the NASA Ocean Biology 

176 Processing Group (OBPG) using the most recent updates in algorithms and instrument 

177 calibration (Reprocessing 2010.0, SeaDAS6.1). The data products include Chloc 4 ? aerosol optical 

178 thickness at 865 nm (t_865), and R rs (k). R rs (^) data extracted from the Level-2 files were used as 

179 the input to derive Chlci (Chi from the Cl algorithm) and compared with those determined from 

180 the in situ measurements. 

181 To evaluate algorithm performance in constructing time series, SeaWiFS Level-2 data between 

182 1998 and 2010 covering two oligotrophic gyres, namely in the Sargasso Sea (15 to 35°N, 60 - 

183 40°W) and in the eastern South Pacific Gyre (20 to 40°S, 120 to 100°W), were obtained from the 
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184 NASA GSFC. For cross-sensor consistency evaluations, SeaWiFS and MODIS/Aqua Level-2 

185 global daily data for 2006 were used. Some Level-2 data files from MODIS/Aqua, MERIS, and 

186 CZCS covering the western North Atlantic Sea were also used for algorithm evaluation. 

187 4. The new empirical Chi algorithm 

188 Similar to the MODIS Cl from the Rayleigh-corrected reflectance [Hu, 201 1] , the R rs -based 

189 SeaWiFS Cl is defined as the relative height of R rs (555) from a background, i.e., difference 

190 between R rs (555) and a baseline formed linearly between R rs (443) and R rs (670) (Fig. 2): 

191 Cl = R rs (555) - [R rs (443) + (555-443)/(670-443)*(R rs (670)-R rs (443))] , (3) 

192 which is approximately Cl « R rs (555) - 0.5(R rs (443) + R rs (670)). 

193 By this definition, for most clear ocean waters Cl is negative. Because for most clear waters 

194 R rs (670) is negligible (see the “clear water” concept in Gordon and Clark, 1981 and revisited in 

195 Morel and Maritorena, 2001), Cl is basically a weighted relative difference between R rs (443) and 

196 R rs (555). Just as a ratio between the two is related to Chi, since R rs (555) is relatively stable but 

197 R rs (443) is sensitive to Chi changes for clear waters [Gordon and Morel, 1983], a difference 

198 between the two should also be related to Chi, and this forms the basis of the new Chi algorithm 

199 (the theoretical basis of this algorithm is provided in Section 6.1 below). Indeed, Fig. 2 shows 

200 that with increasing Chi, the magnitude of Cl decreases monotonically. The added band at 670- 

201 nm has a great advantage in compensating various errors in atmospheric correction and other 

202 corrections when the algorithm is applied to satellite data (see below). 

203 Using the NOMAD dataset, the relationships between band-ratio R and Chi (Eq. 2) and between 

204 Cl and Chi are shown in Figs. 3a and 3b, respectively, for data collected from the 136 qualified 

205 stations. Also overlaid on Fig. 3a is the OC4v6 prediction (solid line), which shows that the 

206 globally optimized regression relationship fits well with the low Chi values. If a similar band- 

207 ratio form is developed using the low-concentration stations only (green dots), slightly better 

208 performance can be achieved as measured by the statistics (Table 1), but at the price of 

209 sacrificing the intermediate values (red line in Fig. 3a) because the numerical fit tends to plateau 

210 for Chi around 0.2 and 0.3 mg m" 3 . 

211 The statistical measure of the algorithm performance is listed in Table 1 . Note that when 

212 evaluating the relative difference between the two datasets x and y (in this case, one is the in situ 
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213 measurement (x) and the other is the algorithm prediction (y)), RMS difference (or error) is 

214 typically evaluated using the form of (y-x)/x. However, when one dataset contains substantial 

215 errors, the (y-x)/x ratio may be extremely large and therefore create biased estimates for the 

216 relative difference. For this reason, an “unbiased” RMS was also estimated using the form of (y- 

217 x)/(0.5x + 0.5y). And this evaluation was also used for comparison between satellite and in situ 

218 Chi data below. When the Chi data cover a large dynamic range they tend to be log-normal 

219 [Campbell, 1995] . Thus, R 2 between the log-transformed data was also estimated and presented 

220 in Table 1 . 

221 Fig. 3b shows that for low Chi values there is a strong relationship between Cl and Chi, 

222 confirming the visual interpretation of Fig. 2. Non-linear regression for Cl < -0.0005 resulted in a 

223 coefficient of determination (R 2 ) of 0.95 (N=50) and a RMS difference of 16.52% between the 

224 Cl-predicted Chi (Chl C i) and the measured Chi: 

225 Chlci= 10 -o.4909 + 191.6590 *CI [ CI < -0.0005 sr" 1 ] (4) 

226 In comparison, for the same data points corresponding to Cl < -0.0005 sr" 1 (N=50), the OC4v6 

227 predicated Chi showed a lower coefficient of determination (R 2 = 0.85, N=50) and higher RMS 

228 difference from the in situ Chi (RMS = 34.87%). Even when new coefficients from these low- 

229 Chi data points were tuned to result in a better fit between band-ratio R and Chi, RMS difference 

230 was reduced to 22.95% but still higher than the Cl predictions (Table 1). Indeed, the contrast 

231 between the different data scattering in Fig. 3a for Chl 0 c 4 and Fig. 3b for Chl C i is apparent. From 

232 this regression alone, the CIA appears to perform better than the OC4v6 for low concentrations 

233 (Chi < 0.25 mg m" 3 ). Note that although the number of data points used in the regression is 

234 limited (N=50), they were collected from different ocean basins (Fig. 3a inset) covering the 

235 Pacific, Atlantic, Gulf of Mexico, and the Southern Ocean. Thus, the CIA might be applicable 

236 for most oligotrophic waters. 

237 Fig. 3b also shows that the CIA may only be applicable for low concentrations, because the 

238 relationship quickly falls apart for Cl > 0.0005 sr" 1 , corresponding to Chlci ~ 0.4 mg m" 3 . The 

239 reason why the CIA does not work well above this concentration is demonstrated in Sections 6.1 

240 and 6.2 using radiative transfer modeling. Indeed, above this concentration, the CIA tends to 

241 underestimate Chi significantly (Fig. 3b), where the original OC4v6 should be used instead. For 

242 intermediate concentrations a mixture between the two algorithms may be used to assure image 
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243 smoothness when the algorithm switches from one to another. For this practical consideration, 

244 the new global product of chlorophyll (Chloci) is defined as follows: 

245 Chloci = Chla [for Chl a < 0.25 mg m' 3 ] 

246 Chloc4 [for Chlci > 0.4 mg m' 3 ] 

247 axChl 0 c 4 + (3x Chlci [for 0.25 < Chl C i s£ 0.4 mg m' 3 ], (5) 

248 where a = (Chla - 0.25)/(0.4 -0.25), (3 = (0.4 - Chl C i)/(0.4 - 0.25). Because such-derived Chi is 

249 from two algorithms (OC4 and CIA), we use the term Chloci hereafter to represent the merged 

250 product. 

251 

252 5. Validation of the new Chi algorithm 

253 The CIA was implemented to derive Chloci from SeaWiFS Level-2 /?, iV (X) data where concurrent 

254 in situ Chi were found (see data source). Fig. 4 shows the comparison between in situ Chi and 

255 satellite Chloci, and between in situ Chi and satellite Chl 0 c 4 - For high concentrations (Chloci > 

256 0.4 mg m' 3 ) the data points between the two algorithms were forced to be identical (Eq. 5). For 

257 low concentrations (Chi < 0.25 mg m~ 3 ), the Cl algorithm outperforms the OC4 algorithm by all 

258 measures, from RMS difference, R 2 , to mean and median ratios (Table 2). Note that although 

259 only a limited number of data points were available for low concentrations, a slight improvement 

260 in algorithm performance may lead to larger difference in image analysis, because the majority 

261 of the ocean is oligotrophic. Indeed, analysis of the 13-year SeaWiFS monthly data between 

262 1998 and 2010 indicated that 77.8±1 .0% of the global ocean waters had surface Chi < 0.25 mg 

_o _o 

263 m' , and 88.4±1 .4% had surface Chi < 0.4 mg m' . Thus, such a new algorithm might have 

264 profound effects on global-scale studies. Note that if a local OCx algorithm is developed for low 

265 concentrations only (Fig. 3a red line), its performance will also improve over the globally tuned 

266 OC4 algorithm in statistical measures and is also slightly better than the CIA in terms of median 

267 ratio. However, its R value is lower than the CIA, especially when a linear form is used. Global 

268 validation results using this local OCx algorithm showed plateaued performance around 0.2-0 .3 

269 mg m" 3 . More importantly, because it takes a similar band-ratio form, it suffers from same 

270 problems as encountered by the OC4 algorithm for low concentrations (see below). Thus, it is 

27 1 listed in the table for demonstration only and was not implemented for global data processing. 
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272 Because only limited in situ data are available to evaluate algorithm performance at low 

273 concentrations (e.g., there is no in situ Chi < 0.02 mg m~ 3 ), below we take a theoretical approach 

274 to compare the sensitivity of Chl C i and Chl 0 c 4 algorithms to various perturbations, including 

275 sensor noise, atmospheric correction, and non-covarying in-water constituents. 

276 

277 6. Algorithm theoretical basis, and its sensitivity to simulated and realistic perturbations. 

278 6.1. Algorithm theoretical basis: why and when it works 

279 Assuming that the influence of measurement geometry (i.e., bi-directional reflectance effects) on 

280 R rs (k) can be corrected [Morel and Gentili, 1993; Lee et al . , 20 11], R rs Q.) is entirely determined 

28 1 by the inherent optical properties (IOPs) through primarily spectral absorption and 

282 backscattering by the various in- water optically active constituents (OACs). These include water 

283 molecules, phytoplankton, colored dissolved organic matter (CDOM or yellow substance), and 

284 detrital particles. In high- wind seas, the OACs may also include bubbles induced by wave 

285 breaking, which may increase the backscattering properties significantly. Following Lee et al. 

286 [2010] , R rs (k) can be expressed using spectral absorption ( a ) and backscattering (/?/,) coefficients 

287 as: 


288 


R [s a,£2) = 


g;(Q)+g; v (Q) 


/rU) 


*-(A) 


G P (Q) + G P (Q) 


M*)' 




/r(A) 


k{A) 


( 6 ) 


289 where the phase-function effects of molecular and particulate scatterings are separated explicitly. 

290 In Eq. (6), k = a + b\>, while £2 represents the solar/viewing geometry. A simplified form has 

291 often been used in the literature: 


292 


R,M) = G 


bM+b^y 

a(A) 


(7) 


293 where bb w and bi- ip are backscattering coefficients of water molecules (constant) and particles 

294 (variable), respectively. 

295 Because i? rs (670) is generally negligible for oligotrophic waters, Cl from Eq. 3 can be 

296 approximated as 

297 (2a(443)^(555) - a(555) ^(443)) + (2a(443) (555) - a(555) \ (443)) 

2a(443) a(555) , ^ 
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298 


Because bb w (443) ~ 2.6 bb w (555), and bb P (443) ~1 .6 bb P (555) (assuming a spectral slope of 2), Eq. 
8 can be further simplified as 


299 


300 

301 

302 

303 

304 

305 

306 

307 


a c (a(555) -0.8a(443)) ^(443) + (a(555) - 1.3a(443)) b hp (443) _ q + A^, 

2a (443) a(555) 2a(443) a(555) ^ 

Fig. 5 shows the two backscattering related terms (A wat er and A par ticies ? xlOOO) for Chi ranging 
between 0.02 and 1 .0 mg m~ 3 , estimated from the Morel and Moretorina [2001] Case-1 model. It 
shows that for Chi < ~0.4 mg m' 3 , | A wate r | overweighs | A part i C ies | . This is due to two reasons: 1) 
low bb P (443) relative to bb W (443) (e.g., for Chi = 0.1 mg m' 3 , bb W (443) = 0.0025 m' 1 , bb P (443) 
~0.0015 m' 1 ); 2) When Chi increases, the corresponding increase in bb P (443) is compensated by 
the decrease in (a(555)-l .3 a(443)). These results suggest that for Chi < 0.4 mg m' 3 , Eq. 9 can be 
further simplified to 


308 ci a -G ^ (443) 

2a(443) 


( 10 ) 


309 

310 

311 

312 

313 


which is equivalent to the band ratio: 

R„ (443) (443) + b bj , (443) q(555) 

RJ555) “ bj555) + b„(555) a( 443) 


R = 


( 11 ) 


In other words, both CI and R are inversely related to c/(443). Because for oligotrophic waters 
a(443) is primarily a function of Chi, CI in Eq. 10 can be expressed as 


CI °c -G 


Kw (443) 
2/ (Chi) 


( 12 ) 


314 This simplified equation explains why Chi can be derived from CI at low concentrations. 

315 

316 6.2. Sensitivity to perturbations from in-water constituents 

317 The empirical Chi algorithms (either OC4 or CIA) are based on the assumption that R rs Ck) is 

318 mainly determined by phytoplankton and its direct degradation product (the so called ‘Case I’ 

319 waters, Morel and Prieur, 1977) or at least other OACs such as CDOM and detrital particles 

320 covary with phytoplankton. For low concentrations, both band-ratio (R) and CI are inversely 
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related to the total absorption coefficient (a(443), Eqs. 10 and 11), where the contribution of 
phytoplankton and CDOM/detrital particles to a(443) must covary in order to derive the former. 
There has been substantial evidence that the OACs often do not covary even for the open oceans 
[Loisel et al., 2002; Dierssen, 2010], which may explain why a globally optimized 
parameterization in OC4 may work well for one ocean basin or one season but its performance 
can be much worse for another [e.g., Gregg and Casey, 2004]. Thus, for global applications, one 
measure to assess algorithm robustness is to test its sensitivity to various scenarios where OACs 
do not co vary. 

For such a sensitivity analysis, the same approach of Lee et al. [2010] to assess IOP algorithm 
uncertainty was adapted here for both the OC4 and CIA. Synthetic data ( R rs ( X) derived from 
various IOP combinations) were used to evaluate the impact of IOP variability on Chi retrieval 
accuracy. 

Briefly, starting from Eq. (6), the geometric parameters (G 0 M '(Q),G 1 M '(t2) ,G,f (Q), and G/’fQ) ; sr 1 ) 

were taken as (0.0604, 0.0406, 0.0402, 0.1310 sr 1 ) [Lee et al 201 1]. The absorption and 
backscattering coefficients were modeled as: 

«(' l) = aJA) + a ph (A ) + a dg (A ) 

W-hM) +\,U) ’ ,13 ' 


where a w (k) and bi m ,(A) are for water molecules and taken from Pope and Fry ([1997] and Morel 
[1974], respectively. a P h(k), dd g (X), and bi, p (X) are for phytoplankton pigments, detrital particles 
and CDOM, and particulate matter, respectively, and they are modeled as: 

a p M)-a ph (440 )a;,(A) 

<J*a)-a A (440) e - su -"» . (14) 

( 440 Y 

\,W = V 440) — 


Here a ph (A) is a p /,(X) normalized to a p i,(440) and taken from the IOCCG [2006] database. 

The dependence of a ( / g (X) and bi, p (X) on Chi (or a p i,(44Q)) were defined as: 

a d g ( 44 °) = Pi «>( 440 )’ 

^(440) = 0.015p 2 CW° 62 , 
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344 where the exponent of 0.62 was taken from Gordon and Morel [1983] , and 0.015 is the 

345 backscattering/total-scattering ratio [Sullivan and Twardowski, 2009]. 

346 For each Chi value (corresponding to an a P h( 440)), four parameters can be changed 

347 independently in modeling R rs (R), and Chi can be retrieved from the modeled R rs (k) with both 

348 OC4 and CIA (Eqs. 2-5) and compared with the input Chi to produce a relative error estimate. 

349 These four parameters include p\,p 2 , S, and r| . Below we show the results of three scenarios. 

350 Scenario 1. Both a& g and bb P vary independently from a p h( 440) 

351 c/ ; ,/,(440) was set to 0.0028, 0.008, 0.024, and 0.05 m" 1 , respectively, corresponding to Chi of 0.02, 

352 0.05, 0.3, and 1.0 mg m" 3 , respectively [Bricaud et al., 1995]. The minimum a,,/, (440) (0.0028 m" 1 ) 

353 is only half of the minimum a P h( 440) in the IOCCG dataset, and approximates the a P h( 440) 

354 values in the South Pacific Gyre [Morel et al., 2007b, Lee et al 2010] . For each <^(440) (and its 

355 corresponding Chl),pi varied from 0.4 to 2.0 with a step of 0.1 (17 pi values); p 2 varied from 0.1 

356 to 0.6 with a step of 0.1 (6 p 2 values); S varied from 0.013 to 0.019 with a step of 0.002 (4 S 

357 values); and r| was set to 0.5 and 1 .5. Thus, for each a^(440) (Chi), there are 816 sets of a&bb, 

358 816 R rs spectra, and 816 retrieved Chi values. The histogram of the relative errors of the 816 

359 retrieved Chi values from each algorithm is shown in Fig. 6. 

360 Except for the “high” concentration case (Chi = 1 .0 mg m" 3 ), the performance of the two 

361 algorithms is similar. Most results showed relative errors to within ±50%. The CIA appears to 

362 yield less data points for errors > 60%, and appears to have a better performance (narrower 

363 histogram) for Chi = 0.3 mg m" 3 . 

364 Scenario 2. ajg varies independently from a P h( 440), but b\, v covaries with a P h( 440) 

365 For each a P h(440) , p 2 was set to 0.45, r| = 1 .0, S = 0.016, but p\ was changed from 0.3 to 2.5 with 

366 a step 0.1 (23 p\ values). Fig. 7a shows that the relative errors in the retrieved Chi from both 

367 algorithms change from negative to positive with increasing a^la P b ratios, an expected result 

368 where the increased CDOM/detrital particles were mistakenly regarded as Chi because they all 

369 strongly absorb the blue light. For extremely low concentrations (Chi < 0.1), errors from the CIA 

370 are slightly higher, but for higher concentrations the errors approach those from the OC4 

371 algorithm. For the extreme case of Chi = 1 .0 mg m~ 3 , errors from the CIA are lower than those 

372 from the OC4 algorithm, especially when adg(440)/a p /j(440) is > 2.0 or < 1 .0. In general, for Chi 
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373 < 0.25 mg m' 3 and the moderate range of ad g (440)/a p fe(440) (1 .0 - 2.0), the retrieval errors from 

374 the two algorithms are similar. 

375 Scenario 3. b\, v varies independently from a P h( 440), but a^ covaries with « p *(440) 

376 For each a P h(440), p\ was set to 1 .0, r| = 1 .0, S = 0.016, but p 2 was changed from 0.1 to 0.6 with a 

377 step of 0.05 (21 p 2 values). Fig. 7b shows that for Chi < 0.3 mg m' 3 , the CIA yielded much lower 

378 relative errors for all cases regardless of the error sign. The errors from the CIA change sign 

379 between 0.1 and 0.3 mg m~ 3 . At Chi = 0.4 mg m~ 3 , the CIA errors approach those from the OC4. 

380 At Chi = 1 .0 mg m' , errors from the CIA are significantly higher than those from the OC4. 

_o 

381 These results suggest that for Chi <0.4 mg m' , the performance of the CIA is often significantly 

382 better than the OC4v6 when b\-, p varies independently from ^(440) (or Chi). 

383 The results above are based on simulated datasets, some of which may not be realistic in nature. 

384 Indeed, on large regional scales, the absorption OACs often covary [Morel, 2009] , although their 

385 relative proportions in modulating the R rs (k ) may change from one ocean basin to another. For 

386 example, although the ratio of ad g (440)/a p /,(440) showed a weak seasonality in an oligotrophic 

387 ocean site and there was an observable temporal lag between the two absorption terms, they did 

388 show high correlations in the temporal patterns [Hu et al., 2006] . In contrast to absorption OACs, 

389 bb P relative to Chi may vary substantially in both space and time [Loisel et al., 2002; Dierssen, 

390 2010] , where the CIA should perform significantly better than the OC4 algorithm in the 

391 algorithm tolerance to the independent bb P changes for low concentration waters. 

392 Overall, from these model-based simulations, we believe that the CIA should perform at least 

393 equivalently to the OC4 algorithm for Chi <0.4 mg m" 3 , if not significantly better. These results 

394 are also consistent to those shown in Fig. 3 when in situ data (assumed error free, but they 

395 certainly contained both measurement and data reduction errors) were used to evaluate algorithm 

396 performance, and consistent with those shown in Fig. 5. 

397 

398 63. Sensitivity to digitization-noise and atmospheric correction errors 

399 The above simulations are based on the assumption that the inputs of the algorithms, namely the 

400 R,s03 data, are error free. In practice, R rs (k) derived from satellite measurements may contain 

401 various errors from imperfect radiometric calibration, instrument noise and digitization round-off 
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402 noise, imperfect atmospheric correction, residual errors from whitecap and sun glint corrections, 

403 and stray light contaminations (Eq. 1 and Fig. 1). 

404 Assuming an error-free calibration and an error-free atmospheric correction scheme, Hu et al. 

405 [2001] used model simulations to evaluate the SeaWiFS data product uncertainties originating 

406 from instrument and digitization noise alone. They found that 1) errors in the retrieved /? rv (A) and 

407 band-ratio Chi were primarily from noise-induced perturbations in the atmospheric correction, 

408 which were propagated and enlarged from the near-IR bands to the visible bands, and 2) relative 

409 errors in the band-ratio Chi were more prominent in both low (< 0.1 mg m ) and high (> 10 mg 

410 m ) Chi ranges than in the intermediate Chi ranges. 

411 The same simulations were applied here to compare relative errors in Chloc4 and Chlci due to 

412 digitization/noise. Briefly, random noise at the level between -0.56 (a) and 0.56(A) was added to 

413 p,(A) in Eq. (1), where 6(A) is the spectral remote sensing reflectance corresponding to 1 digital 

414 count in the individual band: 

415 p, (A) = p,(A) + noise. (16) 

416 p,(A) and p, (A) were fed to the identical atmospheric correction and bio-optical inversion 

417 algorithms under various observation conditions (aerosol type and optical thickness, 

418 solar/viewing geometry), and the derived Chi from the noise-free p,(A) and noise- added p, (A) 

419 were compared and relative error assessed. Figs. 8 and 9 show examples of the simulation results. 

420 For 10,000 model runs of the given aerosol information (maritime aerosol with relative humidity 

421 of 90%) and solar/viewing geometry (scene center, solar zenith angle 0 O = 60°), the errors in the 

422 retrieved R rs ( A) due to digitization/noise alone are presented in Fig. 8. To first order, the errors 

423 are spectrally linear (Fig. 8a), and errors at 443 nm are roughly twice those at 555 nm (Fig. 8b). 

424 Because of the approximate linearity, most of these errors were cancelled in Eq. 3, resulting in 

425 much smaller errors in the Cl (Fig. 8c). In contrast, these same R rs ( A) errors can only be 

426 cancelled to a lesser degree in the band ratio R (Eq. 2), especially when the ratio is significantly 

427 different from 2 (when the ratio is ~2, adding twice as much error in the numerator as in the 

428 denominator will make the ratio unchanged). For the oligotrophic oceans, R rs ( 555) is small (the 

429 blue/green ratio R may reach > 6.0 - 8.0), then large errors in the blue/green ratio could be 

430 resulted when R rs (k) contains small, spectrally linear perturbations. Thus, the different sensitivity 
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431 of R and Cl to the digitization/noise induced errors leads to different accuracy in the retrieved 

432 Chi (Fig. 9). For the Chi range considered here, while the relative errors in Chloc 4 increased 

433 sharply with decreasing Chi, the errors in Chlci remained unchanged at a much lower level. 

434 Simulation results for other aerosol and solar/viewing geometry were different from those shown 

435 in Figs. 8 and 9, but the general pattern remained the same, i.e., relative errors in Chloc 4 were 

436 always higher than in Chlci for Chi <0.4 mg mg' 3 , with only the former depending on Chi. 

437 Clearly, for Chi <0.4 mg m' 3 , Chlci is much less sensitive than Chl 0 c 4 to digitization/noise 

438 induced errors for SeaWiFS. In practice, the atmospheric correction scheme implemented in 

439 SeaDAS has inherent errors to within ±0.002 in reflectance at 443 nm, which is the basis for the 

440 5% fidelity in the retrieved reflectance at 443 nm for clear waters [Gordon and Wang, 1994a; 

441 Gordon, 1997] . The ±0.002 reflectance errors are equivalent to R rs ( 443) errors of ~ ±0.002/71 = 

442 ±0.0006 sr' 1 , corresponding to R rs ( 555) errors of about ±0.0003 sr' 1 . These additional errors are 

443 comparable to those due to SeaWiFS digitization/noise (Figs. 8a & 8b) and are independent of 

444 instrument sensitivity (i.e., they apply to all ocean color sensors including SeaWiFS and 

445 MODIS/Aqua). While the digitization/noise induced errors, assumed randomly distributed, may 

446 be averaged out if sufficient number of points (image pixels) are available, the atmospheric 

447 correction errors may create a bias at various spatial and temporal scales because the conditions 

448 to result in these atmospheric correction errors may not be random (yet the spatial/temporal 

449 distributions of these conditions is unknown). This effect will be shown below with satellite data 

450 analysis. 

451 

452 7. Evaluation using SeaWiFS and MODIS/Aqua imagery 

453 The CIA was applied to SeaWiFS Level-2 GAC data to derive Chlci, and compared with the 

454 default Chloc 4 - In the comparison, the following quality control flags were used to discard all 

455 suspicious data points: atmospheric correction failure (bit 1), land (bit 2), high sun glint (bit 4), 

456 total radiance greater than knee (bit 5), large satellite zenith (bit 6), stray light (bit 9), cloud/ice 

457 (bit 10), coccolithophores (bit 1 1), large solar zenith (bit 13), low water-leaving radiance (bit 15), 

458 chlorophyll algorithm failure (bit 16), questionable navigation (bit 17), near-IR exceeds 

459 maximum iteration (bit 20), chlorophyll warning (bit 22), and atmospheric correction warning 

460 (bit 23). These are the same flags as used to perform data quality control during SeaWiFS and 
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461 MODIS Level-3 data binning. Fig. 1 shows the images of Chl 0 c 4 , Chl C i, t_865, and R rs ( 555) for 

462 the North Atlantic Ocean gyre from an arbitrarily selected date. 

463 The image speckling effect is apparent in the Chloc 4 image (Fig. la), where discontinuity and 

464 patchiness can also be found. While the speckling effect (pixelization noise) is due primarily to 

465 digitization/noise induced errors, the patchiness is more likely due to atmospheric correction 

466 errors and other correction errors (such as whitecap correction). Indeed, similar discontinuity and 

467 patchiness are also found in the x_865 and R rs ( 555) images (Figs, lc and Id). Such sharp 

468 changes and patchiness in both the atmosphere and ocean properties in an ocean gyre are 

469 unlikely to be realistic, but can only be due to algorithm errors. These errors occasionally led to 

470 R rs ( 555) values less than the theoretical limit for even the clearest ocean waters, 0.001 sr' 1 . In 

47 1 contrast to the Chl 0 c 4 image that contains speckle noise and patchiness, the Chl C i image in Fig. 

472 lb, derived from identical R rs (k ) data as used to derive Chl 0 c 4 . shows much smoother and more 

473 spatially coherent distributions even near cloud edges. These results strongly suggest that Chlci is 

474 much more immune to both digitization/noise and atmospheric correction errors, consistent with 

475 those found from the simulations (Figs. 8 & 9). Note that some of the noises are due to straylight 

476 contamination near clouds, but most of these noises are effectively removed by the CIA, 

477 suggesting that these noises are also spectrally linear. 

478 To quantify the image speckling noise from the satellite images, a 3x3 median filter was used to 

479 smooth the Chi images, with the result assumed as the “truth.” The relative difference between 

480 the original data and the smoothed data was assumed to be primarily from digitization/noise 

481 induced errors. To avoid potential assessment bias due to insufficient sample size, all valid 

482 SeaWiFS Level-2 pixels for the 20° x 20° box in the North Atlantic gyre from the 599 images in 

483 1998 were queried, and RMS error for each predefined Chi interval was calculated. Fig. 10a 

484 shows that the RMS errors in Chloc 4 increase sharply with decreasing Chi while these errors in 

485 Chlci remain stable at a much lower level. The overall patterns agree very well with those from 

486 the model simulations (Fig. 9), suggesting that most of these speckling errors originate from 

487 digitization/noise (through error propagation in the atmospheric correction). The discrepancy in 

488 the error magnitude between Fig. 9 and Fig. 10a originated from the different scenarios: Fig. 9 is 

489 for a single observing condition based on simulations while Fig. 10a accounts for all observing 

490 conditions for the entire year. Another reason may be due to stray light and imperfect sun glint 
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491 and whitecap corrections, which were not accounted for in the simulations. Indeed, the SeaWiFS 

492 GAC data were collected by resampling the 1-km data every 4 th row and column, and the 

493 potential small clouds between the resampled pixels may lead to stray light contamination to the 

494 “valid” pixels. These potential stray light problems for SeaWiFS GAC data cannot be assessed 

495 from the data alone because of the data gap (i.e., the resampled “1km” pixels in the GAC data 

496 are 3 -km away from each other). Yet, Figs. 10a and 1 show that under realistic measurement 

497 conditions the relative RMS errors in Chlci is significantly smaller than in Chloc 4 for low 

498 concentrations. This finding holds true even when the SeaWiFS LAC data at 1-km resolution are 

499 used for the same comparison. 

500 The statistics in Fig. 10a also suggest the improvement of the Cl algorithm in reducing the 

a 

501 number of “extreme” data points from the OC4 algorithm (e.g., Chi < 0.02 mg m' ). These 

502 “extreme” points are not only due to digitization-induced errors, but also due to atmospheric 

503 correction errors and/or other algorithm artifacts (whitecap and sun glint corrections, stray light 

504 contamination). Indeed, the changes in the number of valid pixels for each Chi interval from 

505 Chl 0 c 4 to Chlci suggest data redistribution, which will affect time-series analysis over low- 

506 concentration waters. 

507 SeaWiFS data for the North Atlantic and South Pacific Gyres for an entire year were visualized 

508 to examine whether the above observations could be generalized. The results confirmed those 

509 shown in Fig. 1 , and suggest that most digitization-noise related specking errors can be removed 

510 using the CIA for low concentrations, and many other algorithm artifacts (sun glint and whitecap 

511 corrections, atmospheric correction, and stray light contamination) can also be reduced with the 

512 CIA. The effect of such correction on time-series analysis is demonstrated below. 

513 

514 8. Comparison between Chloc 4 and Chid time-series 

515 Fig. 1 1 shows a one-year time-series at an oligotrophic site in the North Atlantic Gyre using 

516 SeaWiFS daily Level-2 GAC data. While the Chl 0 c 4 data show high speckling (high standard 

517 deviations at each 3x3 point) and nearly no seasonality due to other errors, the Chid data show 

518 much cleaner time series and also a clear seasonality. Note that the standard deviation at each 

519 point represents digitization/noise induced errors, but the deviation of the 3x3 mean data value 

520 from the seasonal pattern represents errors from other sources, which are effectively removed in 

521 the Chid time series. This effect also remains for the monthly composite time series at the same 
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522 location (Fig. 12). The seasonality of Chlci is clear in every year of the 13-year time series (note 

523 that there were some missing data after 2005 due to instrument operations), but less apparent in 

524 the corresponding Chloc 4 time series. The mean monthly variance (standard deviation over mean) 

525 reduced from 26.6% in Chl 0 c 4 to 9.9% in Chlci. All these results suggest improvements of the 

526 CIA in constructing Chi time-series for oligotrophic waters. 

527 The improvement of Chlci in deriving a better time series is primarily because of reduction of 

528 algorithm-induced errors as opposed to the reduction in speckling noise. As shown in Figs. 1 1 

529 and 1 as well as in Hu et al. [2001] , while the image speckling noise can be removed using pixel 

530 averaging (either 3x3 or temporal averaging), algorithm-induced errors cannot be removed this 

531 way and will ultimately propagate to higher-level data products in global or regional time-series 

532 analyses. The significantly reduced errors in the Chlci data product may result in more consistent 

533 spatial and temporal patterns than the current OC4 algorithm for the oligotrophic oceans. 

534 

535 

536 9. Discussion 

537 9.1. Algorithm accuracy: band ratio or band difference? 

538 The comprehensive analyses above, from direct validation, theoretical background, sensitivity 

539 analysis through bio-optical and atmospheric correction simulations, to satellite data product 

540 comparison, all suggest that the CIA is more robust than the OC4 algorithm for low 

541 concentrations (Chi < 0.25 mg m" 3 ). This range corresponds to about 77% of the global ocean 

542 area, suggesting potentially profound effects in global- and regional-scale studies. In particular, 

543 studies focusing on ocean gyre variability [McClain et al., 2004 et al., 2004; Polovina et al., 2004] 

544 and second-order ocean chlorophyll variability [Brown et al., 2008] may need to be revisited 

545 with the new algorithm. 

546 The improved performance of the CIA is primarily due to two reasons. First, for most cases 

547 considered, it appears equivalent or even more tolerant (i.e. less sensitive) than the OC4 

548 algorithm to in-water perturbations when the various OACs (especially particle backscattering) 

549 do not covary. Although the non-covariance of the OACs may represent a primary reason why a 

550 “global” algorithm may not work for a particular region [Claustre and Maritorena, 2003; 

55 1 Dierssen, 2010] , it is not the objective of any empirical algorithm to solve this global “puzzle.” 

552 Likewise, the chlorophyll- specific absorption coefficient (i.e., absorption per Chi) may also vary 
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553 substantially due to different pigment composition and phytoplankton size, but all “global” 

554 empirical algorithms would suffer the same from this variability. At the least, the CIA is 

555 equivalent or slightly better for most oligotrophic waters than the OC4 algorithm to the in-water 

556 perturbations. The improved performance over backscattering perturbations is of particular 

557 importance, as this may lead to an improved Chi retrieval in scattering-rich low-concentration 

558 waters due to bubbles or other marine organisms such as coccolithophores. Second and most 

559 importantly, the CIA can partially remove most algorithm artifacts induced by digitization-noise 

560 errors, atmospheric correction errors, residual errors due to imperfect sun glint and whitecap 

561 corrections, and some of the stray light contamination. Although the band-ratio OC4 algorithm 

562 can also remove some of these errors to a certain degree, the removal is much less effective for 

563 low-concentration waters. 

564 Indeed, the concept to use alternative ways instead of band-ratio algorithms to derive Chi is not 

565 new. Campbell and Esaias [1983] proved why a curvature algorithm in the form of Sj 2 /(SiSk) 

566 could be used to derive chlorophyll concentrations. Here Sj represents the measured signal in one 

567 band (calibrated or not) and S; and Sk represent the signals from the two neighboring bands. 

568 Barnard et al. [1999] showed the validity of a similar curvature approach to derive absorption 

569 coefficients. Lee and Carder [2000] further used simulations to compare band-ratio and band- 

570 curvature algorithm performance, and highlighted that band-ratio algorithms were more sensitive 

571 to a wider dynamic range . 

572 Early pioneer efforts for algorithm development also proposed band-difference algorithms 

573 [Viollier et al., 1978; Viollier et al., 1980; Tassan, 1981], where the difference between two 

574 neighboring blue and green bands was related to surface Chi. The rationale for choosing a blue- 

575 green band difference was because of its tolerance to various errors in the spectral reflectance, 

576 including whitecaps [Tassan, 1981]. However, through model estimates, Gordon and Morel 

577 [1983] argued that because reflectance is in principle proportional to backscattering to the first 

578 order (i.e., R rs oc bja, see Eq. 7), a band-difference algorithm will retain most variability of bb 

579 relative to phytoplankton, thus subject to large errors if bb varies independently from 

580 phytoplankton (e.g., sediment-rich coastal waters). In contrast, as long as the spectral variability 

581 of bb is within a narrow range, a band-ratio algorithm will overcome such variability to first 

582 order, making the algorithm less sensitive to independent bb changes. For this reason, except for 
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583 a handful of studies in the 1980s, band difference algorithms have rarely been used in the 

584 published literature. One exception was perhaps the normalized difference pigment index (NDPI) 

585 algorithm proposed by Frouin [1997] for the POLarization and Directionality of the Earth’s 

586 Reflectances (POLDER) instrument [Mukai et al., 2000] , which combined the band-difference 

587 and band-ratio forms using the 443, 490, and 555-nm bands. The NDPI algorithm is essentially a 

588 band-ratio algorithm, although the 443-555 difference in the numerator has been shown to 

589 remove some noise. A similar combination of band-difference and band-ratio was proposed for 

590 the recently launched Geostationary Ocean Color Imager (GOCI), yet its performance over 

591 oligotrophic waters needs to be validated. 

592 The fundamental principles and model simulation results in Sections 6.1 and 6.2 suggest that the 

593 arguments in Gordon and Morel [1983] on the weakness of band-difference algorithms should be 

594 revisited for oligotrophic oceans. Indeed, for Chi <0.4 mg m' , the simulation results showed 

595 that a 3-band difference algorithm (i.e., the CIA) is more tolerant to independent /? b changes than 

596 the band-ratio algorithm. This may appear against intuition for the reasons outlined in Gordon 

597 and Morel [1983] . However, Eq. (6) shows that R rs ( X) is not proportional to particulate 

598 backscattering (Z? bp ), but influenced by both molecular and particle backscattering (£> bw ) and Z? bp . 

599 When Chi is low, the proportion of bb p to total bb is relatively small (e.g., 6^(440) ~35% of total 

600 6/, (440) for Chi = 0.1 mg m~ 3 , and the other 65% is due to a constant water molecular scattering), 

601 resulting in the tolerance of the CIA to independent & bp changes. In addition, the design of Cl 

602 (Eq.3) places more relative weighting of bb w than for bt, p for low concentrations. For high Chi 

603 waters (e.g., Chi = 1 .0 mg m' 3 , Fig. 7b), 6 bp dominates bb, and the CIA becomes more sensitive 

604 than the OC4 algorithm to independent bb P changes, consistent with the arguments of Gordon 

605 and Morel [1983] . For the tolerance to other errors (sensor noise, atmospheric correction residual 

606 errors, sun glint and whitecap correction residual errors, stray light contamination, etc.), the CIA 

607 is better than the band-ratio algorithm, confirming Tassan’s argument. The CIA, however, is not 

608 a simple blue- green difference, but takes a third band in the red to account for the various errors 

609 listed above. 

610 The stability of empirical Chi algorithms to independent bb P changes is particularly important to 

611 reduce Chi errors or inconsistencies either in one ocean basin or across multiple basins. Dierssen 

612 [2010] showed that for low Chi values (< 0.2-0.4 mg m' 3 ), 6 bp (532) may increase by several folds 
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613 from the North Atlantic to the California coastal waters for the same Chi, and Z>b P (532) in the 

614 same ocean basin may also remain relatively stable when Chi varied substantially. Similarly, 

615 Loisel et al. [2002] showed seasonal shifts of Z?b P (490)/Chl from SeaWiFS monthly data for both 

_2 

616 North Atlantic and North Pacific, with their relative ratios varying between ~0.6 and ~1 .7 (x 10' 

617 (m 1 / mg m" 3 )), a change of about 3 folds. Fig. 7b suggests that for a 3-fold change between 

618 0.175 and 0.525 on the x-axis, relative errors in Chla are mostly within ±10% for Chi < 0.3 mg 

619 m' 3 , while the relative errors in Chloc 4 nearly doubled. Thus, the CIA can reduce backscattering 

620 induced errors in the Chi retrieval for oligotrophic waters. 

621 Although the accuracy of the CIA appears to be higher than the OC4 algorithm for SeaWiFS (Fig. 

622 4), it is indeed difficult to evaluate the absolute algorithm accuracy for low concentrations. This 

623 is primarily due to the lack of sufficient high-quality in situ data. The entire SeaBASS archive is 

624 restricted to Chi >0.02 mg m' , and only a limited number of stations had Chi between 0.02 and 

_o 

625 0.05 mg m' . Laboratory measurement errors in determining Chi from seawater samples, using 

626 either fluorometric or HPLC methods, can be 50% [Trees, et al., 1985; Kumari, 2005] . The 

627 errors in these ground “truth” data further weaken the statistical robustness of the validation 

628 results when only several points are available. Future efforts may emphasize the oligotrophic 

629 ocean gyres to collect more in situ data in this range. Because most commercial instruments have 

630 a precision of about 0.01 mg m' 3 , accurate laboratory measurement for this range is extremely 

631 difficult. While new sensors may be developed to increase the precision and accuracy, our 

632 current emphasis is on data consistency across various spatial and temporal scales, for which the 

633 CIA appears to yield better performance than the band-ratio algorithms. 

634 Despite such improved performance in the CIA, all potential artifacts or uncertainties for 

635 empirical algorithms, as discussed and demonstrated in the refereed literature [IOCCG, 2000 & 

636 2006; Dierssen, 2010] , still exist (although to a less degree than band-ratio algorithms, as shown 

637 in the algorithm sensitivity to bb P variability). Both Cl and band-ratio provide a measure of the 

638 spectral change of R rs (either difference or ratio). While most of such changes could be related to 

639 phytoplankton (i.e., Chi), they could also be modulated by changes in CDOM or other OACs. In 

640 addition, all these empirical algorithms assume, implicitly, a stable covariation of the 

641 chlorophyll- specific absorption coefficient with Chi. The ultimate way to improve Chi retrievals 

642 in the global oceans may still be to account for all these variability explicitly through semi- 
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643 analytical inversions, but this is out of the scope of the present work. The semi-analytical 

644 algorithms, at least in their present forms, however, are not immune to the problems shown in 

645 Fig. Id where R IS data (input of the algorithms) contain substantial noise and errors. These errors 

646 must be corrected in order to improve the performance of semi-analytical algorithms. Likewise, 

647 algorithms for many other ocean color products (e.g. IOPs, particulate organic carbon or POC, 

648 particulate inorganic carbon or PIC) rely heavily on accurate R rs (k), whose performance may 

649 also be improved once the errors in the satellite-derived R rs (k) are reduced. 

650 All above analysis were restricted to SeaWiFS GAC data. However, application of the same CIA 

651 algorithm to SeaWiFS LAC data showed similar improvements over image quality. Fig. 13 

652 shows an example of the comparison of Chl 0 c 4 and Chloci using SeaWiFS Level-2 LAC data. 

653 Clearly, all instrument/algorithm artifacts shown in the GAC data (Fig. 1) also exist in the LAC 

654 data (to a lesser degree), but these artifacts can be effectively removed by the CIA algorithm. 

655 9.2. Applications to other ocean color instruments 

656 The improved performance in the CIA for low concentrations appears to be universal across 

657 sensors, although the regression coefficients may need to be adjusted to account for sensor 

658 specifics. Figs. 14-16 show several examples from other ocean color instruments, from 

659 MODIS/Aqua, MERIS , and CZCS , respectively, where improvement in image quality in terms 

660 of reduced noise/errors and image sharpness is apparent. 

661 Similar to the SeaWiFS speckling error analysis shown in Fig. 10a, the same CIA was 

662 implemented to process all MODIS/Aqua Level-2 data for the 20° x 20° box in the South Pacific 

663 Gyre (745 images in 2002). Fig. 10b shows that, although the speckling errors are reduced for 

664 MODIS Chl 0 c 3 relative to SeaWiFS Chl 0 c 4 (MODIS/Aqua instrument signal-to-noise is about 2- 

665 3 times higher than SeaWiFS), the general pattern remains the same, i.e., increased specking 

666 errors with decreasing concentrations. MODIS Chloci, in contrast, shows relatively stable and 

667 much lower specking errors. Nearly all data points in Chloc 3 <0.01 mg m' have been raised in 

668 Chloci, and this is likely to be real, as shown in the example in Fig. 13. 

669 Fig. 13 shows that MODIS/Aqua Chloc 3 data are not immune to noise and algorithm errors even 

670 after all suspicious data (associated with the various quality control flags) are discarded. In 

671 contrast, the CIA successfully “corrected” these suspicious data to reasonable levels, as gauged 
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672 from nearby pixels and adjacent images. This result explains the histogram shift between Chloc 3 

673 and Chlci for extremely low values in Fig. 10b. Furthermore, even when all the quality-control 

674 flags are turned off (i.e., all low-quality non-zero data are used), the CIA appears to perform well 

675 on all those flagged pixels (Figs. 13c&d), indicating that the R rs (k) errors from those pixels are 

676 spectrally related so that the CIA could remove these errors, at least to the first order. This 

677 suggests that the CIA algorithm may also result in more spatial coverage, once appropriate flags 

678 are determined to relax the quality control criteria. 

679 Fig. 15 shows an example of how the CIA (same coefficients used for SeaWiFS) improves 

680 MERIS image quality when compared with the default band-ratio algorithm. The reduction of 

68 1 pixelization and striping noise is apparent in the Chloci image, with more coherent eddy features 

682 observed. More profound improvement has also been found for CZCS (Fig. 16). CZCS is an 8- 

683 bit instrument with much lower signal-to-noise ratio (about 3 times lower than SeaWiFS), and 

684 the band-ratio algorithm resulted in significant speckling noise and other errors (Fig. 16a), where 

685 no ocean feature can be observed. In contrast, most of these errors have been removed by the 

686 CIA, leading to clear eddy and circulation features in the North Atlantic oligotrophic ocean. 

687 Furthermore, the general gradient from west to east in Fig. 16a, a result of algorithm artifact, has 

688 been successfully removed in Fig. 16b. 

689 Although the absolute accuracy in the retrieved Chloci for other ocean color instruments has not 

690 been evaluated, we believe that once algorithm coefficients are tuned for the particular 

691 instruments or the satellite-derived R TS (R) are tuned to the SeaWiFS wavelengths, a significant 

692 improvement in product accuracy, in addition to image quality can be achieved. Such an 

693 improvement may lead to more consistent observations between different instruments. For 

694 example, after a slight adjustment to convert the MODIS/Aqua i? rs (547) to R rs ( 555) and 

695 application of the same CIA and coefficients (Eq. 4) to the global data for 2006, mean ratio 

696 between MODIS and SeaWiFS Chi over the global oligotrophic oceans shows much less 

697 seasonal variability and is closer to 1 .0 from the CIA than from the OCx algorithms (Fig. 17). 

698 Such an improvement is even more profound when data distributions rather than a global mean 

699 ratio are examined. Fig. 18 shows the data distribution for all “deep” waters (> 200 m) from the 

700 band ratio (OCx) and Cl algorithms using all SeaWiFS and MODIS/Aqua data collected during 

701 November 2006. Although there is a slight offset of 0.01-0.02 mg m' in the global mean and 
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702 median values between the two algorithm results (a and b, respectively), the CIA (after blending 

703 with the OCx for Chi > 0.25 mg m" 3 ) resulted in nearly identical histograms between SeaWiFS 

704 and MODIS/Aqua measurements, a significant improvement in data cross-sensor data 

705 consistency as compared from those obtained from the OCx algorithms. Analyses for other 

706 months of 2006 showed similar improvements. Although we are still performing an extensive 

707 evaluation of the new algorithm for the global ocean using all SeaWiFS and MODIS/Aqua data, 

708 the improved consistency between SeaWiFS and MODIS/Aqua measurements from these 

709 preliminary results is indeed encouraging, and may eventually lead to a better multi-sensor Chi 

710 climate data record for long-term studies of ocean biological changes (Antoine et al., 2005; 

711 Gregg et al . , 2005 ; Maritorena et al . , 20 1 0) . 

712 

713 93. Other applications 

714 Studies of the ocean’s biogeochemistry call for the highest accuracy in data products. For many 

715 other applications, such a strict requirement may often be relaxed. For example, tracking of oil 

716 pollution requires timely knowledge on major ocean circulation features including eddies [Hu, 

717 2011; Liu et al . , 20 1 1 ] . The various examples shown in Figs . 13-16 prove that the CIA can lead 

718 to significantly improved image quality for feature recognition when individual images are used. 

719 This is due to its ability to reduce noise and errors as well as to “recover” most of the flagged 

720 (i.e., suspicious) pixels. Some of the eddy features are completely absent in the Chlocx images 

721 due to noise and algorithm errors (i.e., regardless of the color stretch), but are vividly revealed in 

722 the Chloci images. This ability will greatly facilitate studies of eddy dynamics (e.g., Lehahn et al., 

723 2007; Rossby et al., submitted) in the oligotrophic oceans. 

724 

725 10. Conclusion 

726 A novel 3-band reflectance difference algorithm, namely a color index algorithm (CIA), to 

727 estimate surface chlorophyll-a concentrations from satellite ocean color measurements has been 

728 shown superior to the existing band-ratio algorithms in reducing uncertainties for Chi < 0.25 mg 

729 m" 3 , corresponding to about 77% of the global ocean. This was somehow a surprise, given the 

730 known artifacts of 2-band difference algorithms proposed three decades ago. We attribute the 

731 success of the CIA to the new design of adding a third band in the red to the blue-green bands. 
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732 This addition enables the CIA to relax the requirements of spectrally flat errors for the 2-band 

733 difference algorithms to spectrally linear errors for the CIA, and also increases the stability of 

734 algorithm performance over backscattering variability of the ocean. The improved performance 

735 of the CIA over the existing band-ratio algorithms has been demonstrated in all measures, from 

736 global validations using in situ data, atmospheric correction and bio-optical simulations, to 

737 satellite image analysis. The CIA also appears to improve data consistency between different 

738 instruments for oligotrophic oceans. We expect to implement the CIA for multi-sensor global 

739 processing for oligotrophic oceans to further test its robustness, which might lead to different and 

740 potentially improved spatial/temporal patterns of Chi in response to long-term climate changes 

741 and short-term climate variability. 
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934 Tables 

935 

936 

937 Table 1 . Chi algorithm performance for Cl < -0.0005 sr" 1 using the NOMAD dataset. OCi ow 

938 represents a local polynomial fit between the log-transformed band-ratio and Chi for low 

939 concentrations only (Cl < -0.0005, Fig. 3a red line), which shows improved performance than the 

940 globally tuned OC4v6. The regression equation is Chlocjow = lo." 039064- 1 54789x + 3 - 2125 * x * x ' 

94 1 3.io73*x*x*x (jpms j s “unbiased” RMS (see text for details). 


Algorithm 

RMS 

URMS 

Mean 

Ratio 
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Ratio 

R 2 

(linear) 
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(log) 

as 

OC4v6 

34.9% 

28.2% 

1.11 

1.08 

0.73 

0.85 


Cl 

16.5% 

16.2% 

1.01 

1.01 

0.78 

0.95 

50 

OCiow 

22.7% 

22.3% 

1.03 

1.05 

0.73 

0.85 
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948 

949 

950 

951 Table 2. Chi algorithm performance for Chi < 0.25 mg m' 3 , as gauged by in situ Chi (Fig. 4). 

952 SeaWiFS-derived R rs (k) were used as the input of all algorithms. OCi ow represents a local band- 

953 ratio algorithm for low concentrations only (Cl < -0.0005 sr" 1 , Fig. 3a red line). MRE is the mean 

954 relative error after converting negative errors to positive. URMS is “unbiased” RMS (see text for 

955 details). 

956 
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962 Figure captions 

963 

964 Fig. 1 . SeaWiFS Level-2 GAC data products at 4-km resolution on 20 February 1998 over the 

965 Sargasso Sea (about 1800 x 2640 km centered at 25.5°N 54.8°W). (a) Chi derived from the 

966 default OC4v6 algorithm (Chl 0 c 4 ); (b) Chi derived from a new color-index (Cl) based algorithm 

967 (Chla, see text for details); (c) Aerosol optical thickness at 865 nm (x_865, dimensionless); (d) 

968 Remote sensing reflectance at 555 nm (R rs (555), xlO 3 sr" 1 ). All suspicious data, as defined by the 

969 various Level-2 flags, have already been removed (black color). 

970 Fig. 2. Illustration of the Cl algorithm concept. When Chi increases from 0.02 to 0.33 mg m" 3 , 

971 R rs (443) decreases while R rs (555) and R rs (670) remain relatively stable. Thus, the distance from 

972 R rs (555) to the linear baseline between R rs (443) and R rs (670) (dotted line in the figure), defined as 

973 the color index (Cl), is highly corrected with Chi. This is the same principle as using the 

974 R rs (443)/ R rs (555) ratio to relate to Chi. These in situ data are from the NOMAD dataset. 

975 Fig. 3. Relationship between in situ chlorophyll-a concentration (Chi) and (a) reflectance ratio R 

976 and (b) color index (Cl). The highlighted points emphasize those corresponding to Cl < -0.0005, 

977 where the corresponding data collection locations are shown in the inset map. Note that the 

a 

978 minimum Chi in this dataset is about 0.02 mg m' . In (a), the RMS error is estimated between 

979 measured and OC4v6 predicted Chi. If a best fit from all data points for Cl < -0.0005 sr" 1 is used, 

980 RMS error is reduced to 22.95%. Statistics are presented in Table 1 . 

981 Fig. 4. Comparison between in situ Chi and satellite-based Chi for SeaWiFS. The satellite Chi 

982 was derived from both the OC4v6 algorithm (empty circles) and OCI algorithm (dots). Note that 

983 for Chi > 0.4 mg m" 3 the results from the two algorithms were forced to be identical (Eq. 5). The 

984 locations of the in situ measurements for Chi < 0.25 mg m' 3 are shown in the corresponding map. 

985 The comparison statistics for low concentration (Chi < 0.25) are listed in Table 2. 

986 Fig. 5. Relationship between the two backscattering terms in Eq. (9) with Chi. To show their 

987 relative magnitudes, the absolute values (x 1000) are shown here. Note that for Chi < 0.4 mg m" 3 , 

988 the water term dominates the numerator of Eq. (9). 

989 Fig. 6. Chi algorithm sensitivity to independent changes of detrital particles and CDOM relative 

990 to phytoplankton, based on 816 model simulations for each Chi value (Eq. 6, 13-15). 
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991 Fig. 7. Chi algorithm sensitivity to independent changes of absorption of detrital particles and 

992 CDOM (flag) relative to Chi (a), and to independent changes of particular backscattering (b bp ) 

993 relative to Chi (b), based on model simulations for each Chi value (Eq. 6, 13-15). Note that in (b), 

994 the added simulation was for Chi = 0.4 (star symbols), when the errors in the Cl retrievals are 

995 shown to approach those of the OC4 retrievals. 

996 Fig. 8. Errors in R rs (k ) and Cl induced by SeaWiFS digitization-noise after applying the Gordon 

997 and Wang (1994a) atmospheric correction. Most of the errors are due to the impact of the small 

998 noise on the atmospheric correction bands in the near infrared, which extrapolate the atmospheric 

999 properties to the visible (Hu et al., 2001). These errors are approximately linear to changing 

1000 wavelengths (a and b), and can thus be corrected to first order by the Cl algorithm (Eq. 3, Fig. 2), 

1001 resulting in smaller errors in Cl (and Chlci, see Fig. 9). The model parameters are listed in (c). 

1002 Results from other modeling scenarios are different, but the principles in reducing the noise- 

1003 reduced errors using the Cl are the same. 

1004 Fig. 9. Error distribution in the retrieved Chi due to digitization-noise induced R rs (R) errors for a 

1005 clear maritime atmosphere (Fig. 8). In situ R rs data for the input Chi concentrations (from 0.02 to 

1006 0.4 mg m' ) were combined with the R rs (k) errors to estimate Chi, where the “true” Chi was 

1007 determined from the input R IS data free of errors. The differences were used to determine the 

1008 relative retrieval errors. Note that the Cl-based retrieval errors are independent of Chi 

1009 concentrations. 

1010 Fig. 10. (a) Statistics of speckling error in SeaWiFS GAC images in 1998 (n=599) for a 20 x 20° 

1011 region in the Sargasso Sea. The speckling error is defined as the relative difference between the 

1012 original Level-2 Chi and a 3x3 median-filter smoothed Level-2 Chi, with the assumption that 

1013 most noise-induced speckling errors are removed in the latter. Note that while the RMS errors in 

1014 Chl 0 c 4 increase sharply with decreasing concentrations, RMS errors in Chl C i remain stable at a 

1015 much lower level in the entire concentration range here. The overall patterns agree well with 

1016 those from the model simulations (Fig. 9), suggesting that most of these speckling errors 

1017 originate from digitization/noise (through atmospheric correction). The total number of valid 

1018 pixels from each algorithm indicates that all Chloc 4 ^ 0.02 mg m" 3 appear unrealistic due to 

1019 primarily atmospheric correction artifacts, (b) Same as in (a), but data were extracted from 

1020 MODIS/Aqua Level-2 images in 2002 (n=745) for a 20 x 20° subregion in the Southern Pacific. 
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1021 Fig. 1 1 . Chi (mg m' 3 ) time series derived from SeaWiFS GAC R rs (R) data using the OC4v6 

1022 algorithm (top) and the Cl algorithm (bottom). Data were extracted from 3x3 pixels centered at 

1023 24.5°N 55°W from the daily measurements. For any given image (date), only when more than 

1024 half of the pixels (in this case, >5 pixels) contained valid data (i.e., not associated with any 

1025 suspicious flags) were statistics estimated. 

1026 Fig. 12. Chi (mg m~ 3 ) time series derived from SeaWiFS GAC R rs (R) data using the OC4v6 

1027 algorithm (top) and the Cl algorithm (bottom). Data were first extracted from 3x3 pixels centered 

1028 at 24.5°N 55°W from the daily measurements. For any given image (date), only when more than 

1029 half of the pixels (in this case, >5 pixels) contained valid data (i.e., not associated with any 

1030 suspicious flags) were statistics estimated. The daily data were then averaged for the calendar 

1031 month to construct the monthly time series. Note that SeaWiFS was not continuously operational 

1032 after 2005 due to instrument operations. 

1033 Fig. 13. Comparison between SeaWiFS Level-2 Chl 0 c 4 (a) and Chloci (b) over the western North 

1034 Atlantic Ocean. SeaWiFS data were collected on 1 June 2004 (17:15 GMT) and processed with 

1035 SeaDAS6.1 . The Level-2 quality-control flags were turned off to show the circulation features. 

1036 Note that some eddy features are clearly revealed in the Chloci image but absent in the Chloc 4 

1037 image due to noise and residual errors in atmospheric correction and other corrections. 

1038 Fig. 14. MODIS/Aqua Level 2 Chl 0 c 3 and Chloci derived from a subregion in the South Pacific 

1039 Gyre (about 2200 x 440 km centered at 25.2°S 110.8°W) on 4 March 2003, 21:10 GMT. (a) and 

1040 (c) show the default Chloc 3 when the quality control flags are on and off, respectively, (b) and (d) 

1041 are the corresponding Chloci images . 

1042 Fig. 15. Comparison between MERIS full-resolution (FR) Chl 0 c 3 (a) and Chloci (b) over the 

1043 western North Atlantic Ocean. MERIS data were collected on 7 May 201 1 (15:21 GMT) and 

1044 processed with SeaDAS6.1 . Note that most speckling and vertical striping noise in the Chloc 3 

1045 image has been removed in the Chloci image, where several eddy and circulation features can be 

1046 better observed. Further, although the same algorithm coefficients for SeaWiFS were used, 

1047 Chloci values in offshore water appear to be closer than Chloc 3 to those from SeaWiFS for the 

1048 same region during similar periods (Fig. 13). 
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1049 Fig. 16. Comparison between CZCS Level-2 Chloc 2 (a) and Chloci (b) over the western North 

1050 Atlantic Ocean (about 30° - 36°N, 70° - 60°W). CZCS data were collected on 3 1 July 1983 

1051 (16:02 GMT) and processed with SeaDAS6.1 . Note that all eddy and circulation features in the 

1052 Chloci image are completely absent in the Chl 0 c 2 image. 

1053 Fig. 17. Mean Chi ratio over global oligotrophic oceans between MODIS/Aqua and SeaWiFS 

1054 estimates using the OCx (blue) and Cl (black) algorithms. Here “oligotrophic” is defined as all 

_o 

1055 9-km pixels with SeaWiFS mission mean Chi <0.1 mg m' . 

1056 

1057 Fig. 18. Chi distribution in the global deep oceans (> 200 m) during November 2006, as derived 

1058 from SeaWiFS (black) and MODIS/Aqua (red) measurements. Results in (a) are from the OCx 

1059 band-ratio algorithms, and in (b) are from the Cl algorithm (blended with the OCx algorithms for 

1060 Chi > 0.25 mg m" 3 ). Note the offset of 0.01 - 0.02 mg m" 3 in the global mean and median values 

1061 between (a) and (b). Results from other months of 2006 show similar improvements in histogram 

1062 consistency. 
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Fig. 1 . SeaWiFS Level-2 GAC data products at 4-km resolution on 20 February 1998 over the Sargasso Sea (about 
1800 x 2640 km centered at 25.5°N 54.8°W). (a) Chi derived from the default OC4v6 algorithm (Chl 0C 4 ); (b) Chi 
derived from a new color-index (Cl) based algorithm (Chl C i, see text for details); (c) Aerosol optical thickness at 865 
nm (x_865, dimensionless); (d) Remote sensing reflectance at 555 nm (R rs ( 555), xlO 3 sr' 1 ). All suspicious data, as 
defined by the various Level-2 flags, have already been removed (black color). 
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Fig. 2. Illustration of the Cl algorithm concept. When Chi increases from 0.02 to 0.33 mg m" 3 , 
i? rs (443) decreases while i? rs (555) and i? rs (670) remain relatively stable. Thus, the distance from 
R IS (555) to the linear baseline between /? rs (443) and R r s (670) (dotted line in the figure), defined as 
the color index (Cl), is highly corrected with Chi. This is the same principle as using the 
/?rs(443)/ RJ555) ratio to relate to Chi. These in situ data are from the NOMAD dataset. 
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Fig. 3. Relationship between in situ chlorophyll-a concentration (Chi) and (a) reflectance ratio R 
and (b) color index (Cl). The highlighted points emphasize those corresponding to Cl < -0.0005, 
where the corresponding data collection locations are shown in the inset map. Note that the 
minimum Chi in this dataset is about 0.02 mg m' 3 . In (a), the RMS error is estimated between 
measured and OC4v6 predicted Chi. If a best fit from all data points for Cl < -0.0005 sr" 1 is used, 
RMS error is reduced to 22.95%. Statistics are presented in Table 1 . 
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Fig. 4. Comparison between in situ Chi and satellite-based Chi for SeaWiFS. The satellite Chi 
was derived from both the OC4v6 algorithm (empty circles) and OCI algorithm (dots). Note that 
for Chi > 0.4 mg m' 3 the results from the two algorithms were forced to be identical (Eq. 5). The 
locations of the in situ measurements for Chi < 0.25 mg m" 3 are shown in the corresponding map. 
The comparison statistics for low concentration (Chi < 0.25) are listed in Table 2. 
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Fig. 5. Relationship between the two backscattering terms in Eq. (9) with Chi. To show their 
relative magnitudes, the absolute values (x 1000) are shown here. Note that for Chi < 0.4 mg 
the water term dominates the numerator of Eq. (9). 
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Fig. 6. Chi algorithm sensitivity to independent changes of detrital particles and CDOM relative 
to phytoplankton, based on 816 model simulations for each Chi value (Eq. 6, 13-15). 
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Fig. 7. Chi algorithm sensitivity to independent changes of absorption of detrital particles and 
CDOM («dg) relative to Chi (a), and to independent changes of particular backscattering (b bp ) 
relative to Chi (b), based on model simulations for each Chi value (Eq. 6, 13-15). Note that in (b), 
the added simulation was for Chi = 0.4 (star symbols), when the errors in the Cl retrievals are 
shown to approach those of the OC4 retrievals. 
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Fig. 8. Errors in R rs (k) and Cl induced by SeaWiFS digitization-noise after applying the Gordon 
and Wang (1994a) atmospheric correction. Most of the errors are due to the impact of the small 
noise on the atmospheric correction bands in the near infrared, which extrapolate the atmospheric 
properties to the visible (Hu et al., 2001). These errors are approximately linear to changing 
wavelengths (a and b), and can thus be corrected to first order by the Cl algorithm (Eq. 3, Fig. 2), 
resulting in smaller errors in Cl (and Chlci, see Fig. 9). The model parameters are listed in (c). 
Results from other modeling scenarios are different, but the principles in reducing the noise- 
reduced errors using the Cl are the same. 
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Fig. 9. Error distribution in the retrieved Chi due to digitization-noise induced R rs (R) errors for a 
clear maritime atmosphere (Fig. 8). In situ R IS data for the input Chi concentrations (from 0.02 to 
0.4 mg m' ) were combined with the R rs (k) errors to estimate Chi, where the “true” Chi was 
determined from the input R rs data free of errors. The differences were used to determine the 
relative retrieval errors. Note that the Cl-based retrieval errors are independent of Chi 
concentrations. 
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Fig. 10. (a) Statistics of speckling error in SeaWiFS GAC images in 1998 (n=599) for a 20 x 20° 
region in the Sargasso Sea. The speckling error is defined as the relative difference between the 
original Level-2 Chi and a 3x3 median-filter smoothed Level-2 Chi, with the assumption that 
most noise-induced speckling errors are removed in the latter. Note that while the RMS errors in 
Chl 0 c 4 increase sharply with decreasing concentrations, RMS errors in Chl C i remain stable at a 
much lower level in the entire concentration range here. The overall patterns agree well with 
those from the model simulations (Fig. 9), suggesting that most of these speckling errors 
originate from digitization/noise (through atmospheric correction). The total number of valid 
pixels from each algorithm indicates that all Chl 0 c 4 ^ 0.02 mg m" 3 appear unrealistic due to 
primarily atmospheric correction artifacts, (b) Same as in (a), but data were extracted from 
MODIS/Aqua Level-2 images in 2002 (n=745) for a 20 x 20° subregion in the Southern Pacific. 


46 




Fig. 1 1 . Chi (mg m~ 3 ) time series derived from SeaWiFS GAC R rs (k) data using the OC4v6 
algorithm (top) and the Cl algorithm (bottom). Data were extracted from 3x3 pixels centered at 
24.5°N 55°W from the daily measurements. For any given image (date), only when more than 
half of the pixels (in this case, >5 pixels) contained valid data (i.e., not associated with any 
suspicious flags) were statistics estimated. 
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Fig. 12. Chi (mg m~ 3 ) time series derived from SeaWiFS GAC /? rs (X) data using the OC4v6 
algorithm (top) and the Cl algorithm (bottom). Data were first extracted from 3x3 pixels centered 
at 24.5°N 55°W from the daily measurements. For any given image (date), only when more than 
half of the pixels (in this case, >5 pixels) contained valid data (i.e., not associated with any 
suspicious flags) were statistics estimated. The daily data were then averaged for the calendar 
month to construct the monthly time series. Note that SeaWiFS was not continuously operational 
after 2005 due to instrument operations. 
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Fig. 13. Comparison between SeaWiFS Level-2 Chloc 4 (a) and Chloci (b) over the western 
North Atlantic Ocean. SeaWiFS data were collected on 1 June 2004 (17:15 GMT) and 
processed with SeaDAS6.1 . The Level-2 quality-control flags were turned off to show the 
circulation features. Note that some eddy features are clearly revealed in the Chloci image but 
absent in the Chloc 4 image due to noise and residual errors in atmospheric correction and other 
corrections. 
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Fig. 14. MODIS/Aqua Level 2 Chloc 3 and Chloci derived from a subregion in the South 
Pacific Gyre (about 2200 x 440 km centered at 25.2°S 110.8°W) on 4 March 2003, 21:10 
GMT. (a) and (c) show the default Chloc 3 when the quality control flags are on and off, 
respectively, (b) and (d) are the corresponding Chloci images. 
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Fig. 15. Comparison between MERIS full-resolution (FR) Chl 0 c 3 (a) and Chloci (b) over the 
western North Atlantic Ocean. MERIS data were collected on 7 May 2011 (15:21 GMT) and 
processed with SeaDAS6.1. Note that most speckling and vertical striping noise in the Chl 0 c 3 
image has been removed in the Chloci image, where several eddy and circulation features can 
be better observed. Further, although the same algorithm coefficients for SeaWiFS were used, 
Chloci values in offshore water appear to be closer than Chloc 3 to those from SeaWiFS for the 
same region during similar periods (Fig. 13). 
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Fig. 16. Comparison between CZCS Level-2 Chl 0 c 2 (a) and Chloci (b) over the western North 
Atlantic Ocean (about 30° - 36°N, 70° - 60°W). CZCS data were collected on 31 July 1983 
(16:02 GMT) and processed with SeaDAS6.1 . Note that all eddy and circulation features in the 
Chloci image are completely absent in the Chl 0 c 2 image. 
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MODISA(ACI03M)/SeaWiFS(SCI03M) Chl a Ratios for Oligotrophic 



Fig. 17. Mean Chi ratio over global oligotrophic oceans between MODIS/Aqua and 
SeaWiFS estimates using the OCx (blue) and Cl (black) algorithms. Here “oligotrophic” 

_o 

is defined as all 9-km pixels with SeaWiFS mission mean Chi <0.1 mg m' . 
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Fig. 18. Chi distribution in the global deep oceans (> 200 m) during November 2006, as 
derived from SeaWiFS (black) and MODIS/Aqua (red) measurements. Results in (a) are 
from the OCx band-ratio algorithms, and in (b) are from the Cl algorithm (blended with 
the OCx algorithms for Chi > 0.25 mg m~ 3 ). Note the offset of 0.01 - 0.02 mg m~ 3 in the 
global mean and median values between (a) and (b). Results from other months of 2006 
show similar improvements in histogram consistency. 
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